AITopics | full model

On the Out-of-distribution Generalization of Probabilistic Image Modelling

Neural Information Processing SystemsApr-25-2026, 01:02:03 GMT

Out-of-distribution (OOD) detection and lossless compression constitute two problems that can be solved by the training of probabilistic models on a first dataset with subsequent likelihood evaluation on a second dataset, where data distributions differ. By defining the generalization of probabilistic models in terms of likelihood we show that, in the case of image models, the OOD generalization ability is dominated by local features.

artificial intelligence, arxiv preprint arxiv, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Efficient Knowledge Distillation from Model Checkpoints

Neural Information Processing SystemsApr-24-2026, 08:36:06 GMT

Knowledge distillation is an effective approach to learn compact models (students) with the supervision of large and strong models (teachers). As empirically there exists a strong correlation between the performance of teacher and student models, it is commonly believed that a high performing teacher is preferred. Consequently, practitioners tend to use a well trained network or an ensemble of them as the teacher. In this paper, we observe that an intermediate model, i.e., a checkpoint in the middle of the training procedure, often serves as a better teacher compared to the fully converged model, although the former has much lower accuracy. More surprisingly, a weak snapshot ensemble of several intermediate models from a same training trajectory can outperform a strong ensemble of independently trained and fully converged models, when they are used as teachers. We show that this phenomenon can be partially explained by the information bottleneck principle: the feature representations of intermediate models can have higher mutual information regarding the input, and thus contain more "dark knowledge" for effective distillation. We further propose an optimal intermediate teacher selection algorithm based on maximizing the total task-related mutual information. Experiments verify its effectiveness and applicability.

artificial intelligence, machine learning, teacher model, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

ETNet: Error Transition Network for Arbitrary Style Transfer

Chunjin Song, Zhijie Wu, Yang Zhou, Minglun Gong, Hui Huang

Neural Information Processing SystemsFeb-13-2026, 22:20:57 GMT

The proposed model improvesoverthe state-of-the-art methods with better semantic structures and more adaptivestyle patterndetails.

artificial intelligence, machine learning, style transfer, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

TransferableBoltzmannGenerators

Neural Information Processing SystemsFeb-13-2026, 00:35:46 GMT

The generation of equilibrium samples of molecular systems has been a longstanding problem in statistical physics.

artificial intelligence, dipeptide, machine learning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.48)
Education (0.46)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

2ae6b2bdf3a179e3e24129e2c54bd871-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 02:59:53 GMT

original performance 0, performance 0, performance aal, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.58)

Add feedback

6fac9e316a4ae75ea244ddcef1982c71-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 17:06:19 GMT

consistency, efficiency gain, prediction, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

20fdaf67581e6d7157376d1ed584040a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 08:34:19 GMT

edge pruning, experiment, sparsity, (13 more...)

Neural Information Processing Systems

Country:

South America > Colombia > Meta Department > Villavicencio (0.04)
Europe > Latvia > Lubāna Municipality > Lubāna (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

1f88c7c5d7d94ae08bd752aa3d82108b-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 18:54:16 GMT

arxiv preprint arxiv, likelihood, nelloc, (14 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Efficient Knowledge Distillation from Model Checkpoints

Neural Information Processing SystemsDec-27-2025, 15:56:16 GMT

In this paper, we observe that an intermediate model, i.e., a checkpoint in the middle of the training procedure, often serves as a better teacher compared to the fully converged model, although the former has much lower accuracy.

information, intermediate model, teacher model, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Industry: Education (0.96)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Induced Model Matching: Restricted Models Help Train Full-Featured Models

Neural Information Processing SystemsDec-26-2025, 08:07:29 GMT

We consider scenarios where a very accurate (often small) predictive model using restricted features is available when training a full-featured (often larger) model. This restricted model may be thought of as ``side-information'', and can come either from an auxiliary dataset or from the same dataset by forcing the restriction. How can the restricted model be useful to the full model? To answer this, we introduce a methodology called Induced Model Matching (IMM). IMM aligns the context-restricted, or induced, version of the large model with the restricted model.

artificial intelligence, induced model matching, machine learning, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.84)

Add feedback

Filters

Collaborating Authors

full model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

On the Out-of-distribution Generalization of Probabilistic Image Modelling

Efficient Knowledge Distillation from Model Checkpoints

ETNet: Error Transition Network for Arbitrary Style Transfer

TransferableBoltzmannGenerators

2ae6b2bdf3a179e3e24129e2c54bd871-Paper-Conference.pdf

6fac9e316a4ae75ea244ddcef1982c71-Supplemental-Conference.pdf

20fdaf67581e6d7157376d1ed584040a-Paper-Conference.pdf

1f88c7c5d7d94ae08bd752aa3d82108b-Paper.pdf

Efficient Knowledge Distillation from Model Checkpoints

Induced Model Matching: Restricted Models Help Train Full-Featured Models